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Abstract 

We show bounds on total variation and L°° mixing times, spectral gap and magnitudes of the com- 
plex valued eigenvalues of a general (non-reversible non-lazy) Markov chain with a minor expansion 
property. This leads to the first known bounds for the non-lazy simple and max-degree walks on 
a (directed) graph, and even in the lazy case they are the first bounds of the optimal order. In 
particular, it is found that within a factor of two or four, the worst case of each of these mixing 
time and eigenvalue quantities is a walk on a cycle with clockwise drift. 
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1 Introduction 

Markov chains are a key tool in approximation algorithms for combinatorial counting problems and for 
sampling from discrete spaces. Surprisingly, little is known about the convergence rate of a Markov 
chain with no holding probability. Even for the lazy simple random walk (i.e. strongly aperiodic 
nearest neighbor walk) on an undirected graph the order of magnitude for the slowest mixing time is 
not known. 

More specifically, consider an undirected graph with m edges, n vertices and maximum degree d. 
The lazy simple random walk is known to converge in 0(m^ log(m/e)) steps, and so the lazy max- 
degree walk mixes in time 0{n?cP log(nd/e)) as well. However, there are no examples for which either 
bound is of the correct order. We solve this riddle by giving new bounds for simple and max-degree 
walks which are better then these. Moreover, our bounds apply to directed graphs, require no holding 
probability, and are nearly sharp. 

To state our results, recall that an Eulerian graph is a strongly connected directed graph such that 
each vertex has the same in and out-degrees. This is the natural directed analog of an undirected 
graph, as any undirected graph can be made into an Eulerian graph by replacing each undirected edge 
with two directed edges. Two natural walks on a graph will be considered. For the simple random 
walk choose a neighbor uniformly at random and go there, while in the max-degree walk choose a 
neighbor with probability 1/d each and otherwise do nothing. 

Which (non-lazy) directed walks mix rapidly? Certainly the walk should not get stuck drifting 
between sets of equal sizes, such as from one bipartition to another (e.g. simple walk on a cycle with 
an even number of vertices). To avoid this it suffices that if the walk starts in a set of size x < 1/2, 
then the set of adjacent vertices has size > x. For instance, a max-degree walk on a strongly connected 
graph with a self-loop at each vertex. 
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We now give our main results. Note that T(e) is total variation mixing time (time to converge 

at an "average" vertex), roo(e) is L°° mixing time (time to converge at every vertex), Aj 7^ 1 is any 
non-trivial (complex-valued) eigenvalue of the transition matrix, A is the spectral gap, and N{A) = 
{y : 3a; G ^, P(x, y) > 0} is the neighborhood of set A. 

Corollary 1.1. The simple random walk on an Eulerian graph with m edges satisfies 

A > 1 — cos — :r . 

m 

If it satisfies the expansion condition that 

C V, tt{A) < 1/2, G y : tt {N{A) \v)> tt{A) 



then also 



, ,^ , , 27r 27r2 

1 — AJ > 1 — cos — f» — ^ 

m 

1 . 1-2/m m2 1 

T{e) < — ~ log - 

— log cos ^ e 27r^ e 

. [ log^ + logl m2 m2 l\ 2 l + 3e . 

I -log cos ^ 6 8 ^1 ~ 3(1 -I- e)'^ 

m — 2 4 vn? 

~ 2^ + "8~ e ^6G[i/m,i] + (^^^^2 '^^>i 

For f/ie /a2;j/ simple random walk the hound on X is a factor two smaller, the expansion condition 
is replaced by strong connectivity, and in the remaining bounds replace m by 2m. 

It follows that every lazy simple Eulerian walk converges in the same r(e) = 0(m^ log(l/e)) steps 
required for a cycle walk, improving on and generalizing the classical result r(e) = 0(m^ log(m/e)) 
for a lazy simple undirected walk. This can be further improved on by an order of magnitude in the 
special case of a walk on a regular graph, or equivalently of a max-degree walk. 

Corollary 1.2. The max-degree walk on an Eulerian graph with n vertices and max-degree d satisfies 

2 / 7r\ TT^ 

A > - 1 - cos - K, . 
a \ nJ n'^d 

If it satisfies the expansion condition that 

c V, \A\ < \V\/2 : |A^(^)| > \A\ 

then also 



1-IA.I > 



^(^) ^ ~i — TT, ^ log — log - 

-log(l-|(l-cos^)) e 7r2 e 

. / log(n - 1) + log ^ n^d n^d l\ 2 l + 3e 

"00(e) < mm<— — 27- — , — + — — log- > de<i -l-n d- — —— 3 de>i 

[-log(l- l(l-cos^)) 3 4 ej - 3(l + e)3 

n'^d . n-1 . n^d , 4 ^ 2n'^d , 

~ log 0^<l/n + — T log - ^<:e[l/n,l] + i-, , ^2 <^e>l 
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For the lazy max- degree walk the hounds on \ is a factor two smaller, the expansion condition is 
replaced by strong connectivity, and in the remaining bounds replace d by 2d. 

Just how good are these bounds? For the simple random walk on the cycle with an odd number 
of vertices n (so m = 2n and d = 2) the spectral gap bound is off by a factor of 4, the eigenvalue 
bounds are exact, and the upper bounds on T(e) become lower bounds if log is replaced by log ^ 
(where vr* = 2/m and vr* = 1/n respectively). More generally, we define a precise notion of rate of 
expansion, and show that a cycle walk with clockwise drift will be within a factor two of being the 
slowest mixing, not only among simple or max-degree walks, but among all Markov chains with this 
rate of expansion! 

An interesting aspect of our argument is that it uses the Evolving set methodology of Morris and 
Peres [H], in an improved form given by this author 3 which bounds total variation distance directly, 
without going through distance. Related bounds also show that with relative entropy and 
mixing times the cycle walk is again asymptotically nearly the slowest walk. 

The paper proceeds as follows. In Section |21 we review the Evolving set methodology. This is 
followed in Section |21 with a proof of our main mixing result, a generalization of the simple and max- 
degree Eulerian walks considered above. In Section lathis is extended to a bound on convergence rates 
in distances other than total variation. The Appendix contains proofs of inequalities used in showing 
our results. 

2 Review of Mixing and Evolving Sets 

We begin by reviewing mixing time theory, and particularly Evolving Set ideas. 

Let P be a finite irreducible Markov kernel on state space V with stationary distribution vr, that 
is, P is a \ V\ X \V\ matrix with entries in [0,1], row sums are one, V is connected under P (Vx,y G 
V3t : P*(x,y) > 0), and vr is a distribution on V with vrP = vr. The time-reversal P* is given by 
P*(x, y) = ^^^^^^^'^•^ and has stationary distribution vr as well. If ^, -B C ^ the ergodic flow from A to 
B is given by Q{A,B) — X/xgA.j/G-B ^('^)'^('^' Given initial distribution cr, the t-step discrete time 
distribution is given by aP*. 

If the walk is strongly connected and aperiodic then crP* *^°°> vr. Our goal is to measure the rate 
of convergence. One of the more widely used notions of distance between distributions is the variation 
distance. 

The mixing time r(e) denotes the worst-case number of steps required for the total variation distance 
||P*(x, •) — 'kWtv to drop to e. 

Many bounds on mixing time are shown by working with the spectral gap, which is just the gap 
between the two largest eigenvalues of the walk ^"^^ , that is, 

A -mini A ( ^ + ^* \ - inf I - /(?/))M^)P(^> ^/) 

where {Ai(K)} denotes the eigenvalues of Markov chain K, and Ao(K) = 1. 

Our results are based on a theorem of Montenegro [3] , which is in turn proven by working with a 
lower bound on mixing given in 2 , and with the Evolving set methodology of Morris and Peres |Sj in 
a stronger form given by Montenegro |3j. 
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Theorem 2.1. Consider a finite, irreducible Markov chain. Given f : [0, 1] such that Va G 

(0, 1/2] : < /(a) < /(I — a), let the f -congestion Cf = max^{-y ,r(yl)<i/2 where 



= — A« = {z; e y : Q(^, r;) > u-k{v)} . 



fi f{7r{Au)) du 

Then, the t-step Markov chain satisfies 

- Ai * < max P* X,- -vr TV < max \,\\,, max ^ / , 

w/iz/e every (complex valued) eigenvalue Aj 7^ 1 of the transition probability matrix satisfies 

1 - |Ai| >1-Cf. 

If instead f satisfies the weaker condition Va E (0, 1) : /(a) > 0, then the result still holds, but 
with Cf = ma.XAcvCf{A). 

The proof is not yet in print, and so it is included in the Appendix to this paper. 

The /-congestion Cf{A) is a measure of the expansion (or congestion) of a random walk, and seems 
particularly well suited towards showing geometric bounds on mixing times. While it is generally 
not easy to calculate directly, the following lemma of [Hj makes it possible to bound C/ in terms of 
isoperimetric quantities. 

Lemma 2.2. Given a concave function / : [0, 1] ^ M and two non-increasing functions g, g : [0, 1] — > 
[0,1] such that g{u)du = g{u)du andMt e [0,1] : g{u) du > J^g{u)du, then 



/ / o g{u) du< 
Jo Jo 



f og{u)du < / f og{u)du. 
Jo 

Our interest is in bounding f{-K{Au)) du for a concave function /. To apply the lemma we 
minimize the integral Jq tt{Au) du, recalling that tt{Au) is a decreasing function of u and noting that 
Jq -7t{Au) du = tt{A), while taking into account whatever constraints are given by the problem of 
interest. A particularly useful constraint to consider is the modified ergodic flow 



1 

yAcV: ^{A) = - / \tt{Au) - TT 
^ Jo 



{A)\du. 

By the property 'rT{Au) du = tt{A) it follows that ^{A) is the area below it{Au) and above tt{A), 
while also the area below Tr{A) and above 7r(j4„). See Figures ^ and [21 for two such examples. 

A useful interpretation of ^{A) is as the smallest ergodic flow from set ^ to a set B of size 
tt{B) = TriA") (see 0), that is 

^{A)= min Q(A,B) + ^^^^^l^^Q{A,v). 

bcV,vgv, -ir[v) 

iT{B)<TT{A'')<n{BUv) 

When the distribution tt is uniform then this simplifies to 

^'(A) = min Q(A,B), 

BCV, 
n{B)=n{A'') 
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while in general if the walk is lazy (i.e. Vx E y : P{x,x) > 1/2) then ^'(^4) = Q{A,A'^) with the worst 
set B being B = A'^. 

Now, what is a good choice of function / for the /-congestion? In |4j it was suggested that if 
the modified ergodic flow ^'(^) > C, C V for some constant C not depending on set size, such 
as the simple random walk on an odd length cycle (with ^'(vl) > then it is best to work with 

/(a) = sin(7ra). In this case Theorem 12.11 implies 

||P*(2;,-) - vtIItv < (1 - vr*)C*ij^(^^) and 1 - |Ai| > 1 - Csin(7ra) , 

where vr^, = min^gy 7r(f ). 



3 General random walks 



We now set out to show our main result, eigenvalue and total variation mixing bounds for general 
random walks (the case will be dealt with in the next section). Two corollaries of this will be the 
specific walks on Eulerian graphs discussed in the introduction. In particular, we will find that even 
when general Markov chains are considered, a walk with clockwise drift on a cycle is still within a 
factor two of being the slowest mixing Markov chain. 

Recall from the preliminaries that ^(^) is both a measure of area between the curves y = tt{Au) 
and y = T^iA), and also a measure of the worst ergodic flow from A into a set of size tt{A'^). While this 
will play a key role in our proof, our main theorem will involve a slightly weaker quantity. In practice 
these two will usually be the same. Given A CV, let Q(A,x) = min{Q(^, x), 7r(a;)/2} and deflne 



^(A) = min 

BCV,VGV, 
n{B)<n{A'=)<7T{BUv) 



x) + 



7r(A'= 



7r{v) 



As with ^{A), when the distribution is uniform then this is just 



min^(B)=^(^c) Q{A,B). 



Moreover, if the walk is lazy then *(^) = Q{A,A'') = Q{A,A'') = "^(A), or if ^'(^) < A™„/2 
(defined below) then again "^{A) = "^{A). 

To motivate the form of our main result, we note that in their work on Blocking conductance 
Kannan, Lovasz and Montenegro [Tj show that the square of conductance can often be replaced by 
a product of a measure of vertex boundary and a measure of edge expansion. Likewise, our general 
bound will involve a product of edge expansion with a measure of vertex boundary Amax-, rather 
than just the square of edge expansion which is found in most isoperimetric results. 

Theorem 3.1. Given a finite Markov chain, let 



Qr 



min ^(>1) Ar 

7r(A)<l/2 
T(A)^7r(ij) 

minQ(^,^'') ir^ 

AcV 



min{ifmin, Amm/2} 

max{^'mm) ^min/2} 



minvrfx) 

xev 
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Then, 



"^i^) ^ 7 i : ^ log 



log ( 1 - 2-^" 



(1 - COs(27r Arnax))^ 



1 , 1 - vr* 
log 



A 

l-|Ai| > 2 ™" (1 - COs(27r Amax)) - 27r^ 



\ \ '^Qmin /-. I \\ ^ 2 r\ 

A > (1 - COS(7r7r^,)) PS TT ■nMmin 

Proof of Collaries \l.l\ and M.'A (see Section^for the L°°-bounds). First to Corollarv ll.il Suppose that 
7r{A) < 1/2, and S C F satisfies = Q{A,B) + ^^^^^^^^Q{A,v). If Af(^) C 5^ then 

it{N{A) \ v) < ^{B^ \v) = 1 — Tr{B U v) < tt{A), contradicting the expansion condition. Hence, 
N{A) n B / and so 3x E ^, y e B with P(x, y) > 0, and so ^'(^) > Q(A, B) > tt{x)P{x, y) > 1/m. 
Likewise, for some B C V, ^{A) > Q{A,B) > min{Q(x, y), 7r(y)/2} and so if{A) > 1/m if Tr{y) = 
deg{y)/rn > 2/m. If deg{y) = 1 then A^({y}) has only a single vertex v, and so 7r(A^({y}) \ w) = 
contradicting the expansion condition. It follows that 7r(y) > vr,,, > 2/m. Corollarv 11.11 then follows 
from Theorem 13. II and the bound A^in > 1/m. Corollarv 11.21 follows similarly, but with ^{A) > 1/nd 
and Amin = l/n. □ 

Note that the max-degree walk is actually the same as the simple random walk when each vertex x 
has d—deg{x) self-loops added, and yet Corollarv ll.2l is much better than that induced by Corollarv ll.il 
To understand this, recall that, in keeping with the intuition of Blocking Conductance, Theorem 13.11 
will greatly improve on a bound involving edge-expansion alone (i.e. ^{A) or ^{A)) if A^m 3> ^min- 
In fact, the max-degree walk had Amin = l/n»l/nd = ^min- 

The theorem gets us very close to answering the question of what is the worst of all random walks, 
as shown by the following examples. 

Example 3.2. Consider the simple random walk on a cycle (with m = 2n edges). Since m = 2n and 
d = 2 then Corollaries 11.11 and 11.21 are the same. In Example 13.31 we find that Corollarv 11.21 is exact 
for the eigenvalue gap and essentially sharp for mixing times, and hence Corollarv II. II is equally good. 

Example 3.3. Consider a max-degree walk on a cycle with an odd number of vertices n, such that 
at each vertex there are d — 1 edges pointing in the clockwise direction, and 1 edge pointing in the 
counterclockwise direction. 

This walk has an eigenvalue A^ = i^e''^^"-^)/" + ig-'^^^^-i)/" with eigenvector f{x) = e^^^'^-^)!'^ 
where the vertices are labeled clockwise as a; G {0, 1, . . . , n — 1}. Then 



, 4, , „ „ 7r(n - 1) 2 , , „ /7r\2 2'k^ 
Corollarv 11.21 gives a fairly similar bound of 

2 vr^ 
min 1 — lAfcl > -(1 — cos(7r/n)) k, —— . 

d n^d 

The upper and lower bounds are equal at d = 2, and within a factor two of equality when d > 2. 



6 



For spectral gap, note that is just the simple random walk on a cycle, and the largest 

eigenvalue of this is cos(27r/n). Consequently A = 1 — cos(27r/n) ~ ^V- By Theorem 13.11 every walk 

2 

with Qmin = 1/n and vr^, = 1/n satisfyA > 2(1 — cos(7r/n)) « and so our drifting walk is within a 
factor two of having the worse spectral gap among all walks with Qmin = 1/n and ir^, = 1/n. Although 
Corollarv ll.2l is quite poor for this example, it is only off by a factor of four when considering instead 
the simple random walk on a cycle with d — 2 self-loops (and A = 1 — |(1 — cos(27r/n)). 
Likewise, the upper and lower bounds on mixing time are quite similar: 



r(e) > . V. | logj-> 
-log|Afc| 2e 



while the upper bound is 

r{e)< 



1 



1 



n'^d 



1 



logVl-|(l-l/d) sin 



2 7r(ra-l) 



log ~ K log 

^ 2e 27r2 ^ 2e 



log(l - ^(1 - cos(7r/n)) 



log 



1 - 1/n 



n'^d 1 
^log- 



The bounds are nearly equivalent at d = 2, and within a factor two of equality when d > 2. When 
n = 3 and d = 2 then the lower bound can be sharpened slightly to be exactly equal to the upper 
bound. 

Example 3.4. Consider a general Markov chain. Note that if vertex v £ V has tt{v) = ir-t then 
^min < ^({^}) ^ T^*{1 — 71"*). The clockwise Markov chain given above had ^min = 1/nd = ir^/d. If 
instead the walk has transitions P(a;,x+1) = a G [1/2,1] and P{x,x—1) = 1 — athen^mm = (1 — a)vr*. 
If ^min ^ vr*/2 then when a = 1 — ^min/'^* ^ 1/2 the upper and lower bounds in Theorem 12.11 are 
again within a factor 2 or 4 from the correct values. 

Proof of Theorem Vj . 1\ As suggested in the preliminaries, we will study the /-congestion Csin(7ra)i via 
Lemma l2.2l This will be done in two steps. First, we show a result appropriate for max-degree random 
walks. Then we consider a case relevant to the simple random walk. 
Fix some set A dV . 

First consider the case that ^[A) < Amin/2. 

Notice that if tt{Au) > 7r{A) then tt{Au) > tt{A) + Amin, while if Tr{Au) < tt{A) then tt{Au) < 
7r(^) — Amin- In Figure^let the solid line sketch the curve Tr{Au), and note that the dashed line m{u) 
encloses the same area ^'(^4) but decreases the integral, so /q m(n) du < /q 7r(A„) du. 




area 
^(A) 




area 
^(A) 



U 



Figure 1: If ^'(A) < 



1/2 1 
then Jq m{u) du < Jq '7t{Au) du and ni{u) du = 'n{Au) du. 
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The curve m{u) is given by 



if U < ^{A)/A^in 
m{u) = I Tr{A) if U G {^{A)/Amin, 1 - '^{A)/Amin) 

[n{A) - iiu>l-^{A)/Amin. 

By Lemma 12.21 since m{u) du = T^iAy) du = '7r(yl), it follows that if 7r(A) = x then 

sin(7rm(n)) du 



Csin(7ra) (^) 



< 



< 1 



sin(7rx) 

^{A) , .... 
(1 - cos(^A™„)) 

2-^ (1 - cos(^A™„)) . 



Now, consider the case that ^'(vl) > Amin/'^- 

In Figure IS let the solid line in the left diagram sketch the curve 7r(A„). If this has portions above 
y = tt{A) but beyond u = 1/2 then truncate these off, with the portion below y = tt{A) raised slightly 
to keep the enclosed area constant, making the left figure into the right side one. Note that the dashed 
line m{u) encloses the same area ^{A) but decreases the integral, so jQm{u)du < it{Au) du and 

m{u) du = Jq 7r(A„) du = ir{A). 



71(A) 




71(A) 




Figure 2: Curve m{u) satisfies Jq m{u) du < Jq 7r(A„) du and m{u) du = Jq 'Jt{Au) du. 



The curve m{u) is such that, if p G [0, 1/2] denotes the value of u where m{u) crosses the line 
y = ^{A), that is m{u) > n^A) ii u < p and m{u) < ir{A) if u > p, then 



m{u) 



n{A) + ^ ifn<p 



n{A)-^ if n>p 

By Lemma 12.21 it follows that, if g{u) = tt{Au) and x = it{A) then 

/•I 

sm{7r g{u)) du < / sin^iT m{u)) du 



/ / ^(A)\\ , , ^(A) 
= p sin vr a; H + (1 - p) sin vr x - 

\ \ p jj V V 1-^, 

< sin(7rx) cos(27r^(yl)) , 
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where the final inequality is from Lemma 14.41 in the Appendix. Hence, Csin(7ra) (^) ^ cos(27r^'(y4)). 

Combine these two cases, maximize over sets A CV, and apply Theorem l2.1l to obtain the mixing 
time and eigenvalue bounds. 

For the spectral gap, note that 



I P + P* 
A = 2 min 1 - ( - H ^ — ) = 2 min 1 



2 4 



This shows it suffices to study eigenvalues of P' = | + . However, P' is a lazy walk, and so 
^{A) = "^(A) = Qp/{A,A''). This is in turn half the ergodic flow Qp+r. (^,^^), and so #(^) = 

iQp+P^(A^') = 5Qp(^, A^) (since Qp(^,^") = Qp*(A^') = Qp+pi{A, A^)). In short, 

2 2 

/IP_lp*\1 1 
^mm ( - + — I — J = - minQp(A^'=) = -Q™„(P) . 

Before applying the eigenvalue bounds proven earlier, note for a lazy walk that ^{A) = Q{A, A'^), with 
< i only if X e A^, and > i only if x G yl. It follows that if 7r(A„) > tt{A) then A C A^ 

and so tt{Au) > Tr{A) + n^. Likewise, if tt{Au) < it{A) then A ^ A^ and so vr(^„) < it{A) — vr*. Hence, 
when studying a lazy walk (such as | + ), Amm may be replaced by vr* in our earlier analysis. 
But TT* (I + ) = 7r*(P), and so the spectral bound follows from the earlier eigenvalue bounds. □ 



4 Other distances 



Total variation distance measures only distance from stationary at an average vertex. The much 
stronger L°° distance measures distance from stationary at the worst vertex. In this section we show 
mixing bounds on L°°, and relative entropy distances which are again within a small constant 
factor of those for the walk on a cycle with clockwise drift. 

Given distributions a and vr, the relative entropy distance D((T||7r), the distance Hcr/vr — l||2,7r 
and the relative pointwise (or L°°) distance Wc/ir — l||oo,7r are defined by 



D((7||7r) 



k/vr - l||2,7r 



Ifj/vr — 1| 



tt{x) J 'ir{x) 



Tr{x) 



a(x) 
7r(x) 



max 



a{x) 



7r(x) 



The worst case number of steps required for a walk to reach distance e is given by the mixing times 
r£)(e), T2{e) and Too(e) respectively. These are related by T(e) < r£)(2e^), r(e) < T2(2e), T£){e) < T2{^/e), 

T2ie) < roo(e) < T2 (e^J^^'J, and Too(e) < T2,p(^) +r2,p.(^) where T2,k{<^) denotes mixing for 
the Markov chain K (e.g. see Appendix of 5 for proofs and/or references for these). 

Evolving set bounds on relative entropy and mixing (see |S1 or [^) can be combined with the 
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technique used to prove Theorem 12.11 (see @] ) , to show 

1 



'roo(e) < 



1 ^sin{7ra) 



1 



1 n 

'-'sin{7ra) 



log log — + log - 

vr* e 



1. l-TT, , ^ 1 

- log h log - 

2 vr* e 



1 — TT* 1 

log h log - 



1 Csin{7ra) \ '^*: £ 



The L°° bound followed from the bound and the relation Too(e) < T2 (^^y ^^^* 

When e ^ these are asymptotically the same as the r(e) bound of Theorem l3.1l However, when 
e is large we can further improve these via Evolving set bounds shown in Montenegro (see also ) : 

Theorem 4.1. For a finite Markov chain, ifCf{r) = max7r(A)<r C/(j4) Vr G [0,1] then 

dr 



r{e) < 
TD{e) < 

T2(e) < 



(l-r)(l-C„(i_„)(r)) 

dr 

, rlog(l/r)(l -C„log(i/„)(r)) 

1/(1+^^) dr 



Ml-r-)(l-C^;^(r)) 

where the total variation bound requires (l — ^(i-a) (1 ~ ^)) be convex, relative entropy requires 
r (l — Caiog(i/a)(e~'")) to be convex, and the bound requires r ^1 — C (^j:^^^ to be convex. 

It remains to bound Cf{r) for each choice of /(a) given above, then integrate in Theorem 14.11 
First, the bounds on /-congestion for the quantities used in the various distances of interest to us. 



Lemma 4.2. 



1-Ca(l_a)(r) > 



2 Amin ^min 

r(l — r) 



Jr<l/2 



1 - (-'alog(l/a)V) > -TTT'-i r' "r<e-l/2 + elXmin^min 

Proof. Montenegro ^3; uses Lemma 12. 21 and optimization similar to the use of Lemma 14.41 in the proof 
of Theorem 12.11 to lower bound the /-congestion quantities in terms of ^{A): 



^-Ca(i-a){r) > mm 
^-Ca\og(i/a){r) > jam 

1-C 



7rlA)<r Tt{A)tt{A'') 

2^(A)2 



4^^ . 

^ mm 



' - ^(A)<r 2^(^)2^(^^)2 - 2r2(l-r) 



^(A)<r 7r(A)2 log ^ 
^(^)2 



r(l — r) 



2^2 . 



r2 log ; 



r 

^Ln 2 
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Improvements are again possible for max-degree type walks. If ^min < ^min/'^ then the worst 
case m{u) was already determined in the proof of Theorem 13.11 frecall that Lemma l'2.2l savs m{u) is 
the worst for every concave function / : [0,1] Notice that the upper bound on C^^^(^^n-^{A) 

could have been obtained by taking 



l-C/(a)(^)>2|^ 
'—^m.in. 



1 - C 



/(a)(^(^))U(A). 



./2 



In short, the case with ^rnin ^ ^rnin/'^ can be reduced to the case of ^min ^ ^min/'^^ without losing 



any accuracy. However, bounds in terms of ^min were found at the beginning of this proof, so it 
suffices to replace ^{A) by Amj„/2 in these bounds, and then multiply by 

Combining the two cases ^'(^4) > Amm./2 and '^{A) < Amm/2 gives the lemma. □ 



Applying Theorems 14.11 and 12.11 to the /-congestion bounds of Lemma 14.21 leads to the following 
relations: 



Corollary 4.3. For a finite Markov chain, if 



min ^(A) and Ar. 

Acy 



max{^'^ 



^} then 



1- lAi 



re 



> 
< 



le^'min^ 

l-47r^ 



mm-^max 
2 



16^minAr, 



8^min.A^ 



mm-^max 



roie) < 



r2ie) < 



Note that ^'^m > ^mm, 



1 - e7r2 + log ^ 



-2e 



"^(i^minAmax 



t<l/2 



^^minAmax 



8^- 



l + 3e2 



mm-^max 



6^'mm^max(l + £ 



^e>l/2 
\>l/2 



2\3 



I, and so even the total variation bound can occasionally improve on 
Theorem 13. II For instance, the simple random walk on an expanding Eulerian graph has ^min > 1/""^, 
and so the bound here is better than that of Cor ollarv 11.11 when m is large and e > 0.361. This explains 
why our upper bound on mixing time was not quite sharp before. 

Proof of L°° cases in Corollaries \ 1. 1\ and MJA We use the relation roo(e) < T2^p{^/e) + T2,p*(-v/e). 
First, Corollary O li A,B CV and 7r(B) = 7r(A'=) then 

Qp(^, B) = niB) - Qp(A^ B) = 7r(S) - 7r{A'^) + Qp{A^, B^) = Qp*(i?^ A') . (4.1) 

Since vr is uniform for the max-degree walk then '^{A) = mm^(^B)=TT(A'') Q(^7 B)^ and so by 1)4. 1() ^'mm 
is the same for P and P*. Then roo(e) < 2r2(y^) if the Lp' mixing bound from Corollarv 14.31 is used, 
that is, if we substitute the conditions A^m > I/'t-, ^min ^ 'i/nd and A^ax > l/2n. 

Now to Corollarv im If A, B C V , v £ V and 7r{B) < TriA") < 7r{B\Jv) then, arguing as in (jUTJ), 



Qp,iA,B) + 



^ ' ^ '-Qp.{A,v)=Qp{B^\v,A^)+[l--^ ' ^ ' 



Qp{v,A^ 



(4.2) 



Arguing as in the proof of Corollaries ll.ll and ll.2l after Theorem 13. 11 if it{C) < 1/2 and ir{D) < tt{C'^) < 
tt{DUv) then Q{C,D) > 1/m. Note that for some A,B,v with 7r{A) < 1/2, equation is exactly 
^min for the walk P*, and so if C = B'^Xv and D = A"" then it follows that ^min > QpiB^Xv, A") > 1/m. 
Hence A^m > 1/m, ^min ^ 1/m and Amax ^ l/2m for both P and P*. The case in Corollary 
II. II follows by the resulting bounds on T2,p(v^) and T2,p*(v^) in Corollarv 14.31 □ 
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Appendix 

In the Appendix we look at two results needed in this paper. First, the proof of Theorem 12.11 and 
then the proof of an inequality used in showing Theorem 13.11 

Proof of Theorem \2.1\ Given x £ V, the Evolving set process is defined recursively by setting Sq = 
{x}, and then to determine St+i choose u £ [0,1] uniformly at random, and set St+i = {St)u- Let 
Et denote the expectation after t steps of the Evolving set process. Also, we use the notation S"^ to 
denote S if 7r(5) < 1/2 and = V \ S if Tr{A) > 1/2. 

Starting with an inequality of [HI (see also [3]), we have 

||P*(:c,-)-vr||Ty < Et7r(5t)(l - ^(^t)) 

1T[X) 

< ( m» =ti)^)^E,/W5f)) 

I AcV, /(vr(A)) I 7r(x) 

\7r(A)<l/2 / 
\7r(A)<l/2 / 

< ( TTjAMA^) ] fini{x}#)) , 
- S /(vr(A)) vr(.) 

\7r(A)<l/2 / 

The final inequality followed from Cf{SY_i) ^ Cj, and then induction. 

The lower bound on total variation distance can be found in [2] (see also IS]). 
The bound on eigenvalues follows by combining the upper and lower bounds: 



|A,;| < 



2C* max ^^tmiMi^t^c, 

7r(A)<l/2 
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□ 



We have left for the Appendix the proof of an inequality key to our main theorem. 
Lemma 4.4. If < a < ^, < b < 1/2 and < c < b{l - b) then 



h{a, 6, c) 



b sin i'^ (O' + t)) + — b) sin ( vr ( a 



1 



< sin(7ra) cos(27rc) 

Proof. First, use the expansion sin(x + y) = sin(x) cos(y) + sin(y) cos(2;) to re-arrange the terms a bit: 



h{a, b, c) 
= sin(7ra) 



vrc 



b cos — + (1 — 6) cos ^ 



vrc 



+ cot(7ra) ( 6 sin 



vrc 



(1-6) sin 



vrc 



(4.3) 



Consider the second term. Suppose b £ [c, 1]. Then 



d TTC 



vrc vrc vrc 

sin — cos — 

b b b 

TTC ( TTC TTC 

cos — tan — — 

b \ b b 



> 



The inequality is because (cos x)(tanx — x) > when x = ^ G [0, vr]. 
It follows that 6 sin ^ is increasing, and in particular if 6 < 1/2 then 

vrc vrc 

b sin — (1 — b) sin < . 

6 1 — 

Consequently, if b and c are fixed then the ratio ^fr^'^'^j is maximized when a S [0, 1/2] is maximized. 



Subject to the conditions on a, b and c in the lemma this maximum is at a = min{l/2, 1 — c/6}. 



It has just been shown that if 6 > 2c then ^^"-j^'^} < ^(^/^'^'^) 



h(l-c/b,b,c) 



sin(7ra) 



sin(7r/2) 



h{l/2,b,c), otherwise — 



. The latter case can be simplified further by the relation 



sin(7r(l— c/fe)) 

b < 2c. This requires showing that 



h{l~c/b,b,c) 
sin(7r(l— c/fe)) 



h{a,b,c) 
sin(7ra) 



< 



< h{l/2,2c,c) when 



(1-6) sin 



vrc 



vrc vrc 

6(i-6)^('-'^)^^"T^°^T3^ 



where | < c < 6(1 — 6). The substitutions x = ^p^^ and y = ^ (ie. 6=1 — y/x and c = nHzlIll'^ 
reduce this to Lemma 14.51 and so the inequality holds. 

Combining results, we now know that < h{l/2, min{2c, 6}, c). The second term in Equation 

1)4. 3|) is zero when a = 1/2, and so to study /i(l/2, max{2c, 6}, c) it suffices to consider the first term 
in Equation (|4.3|) . Note that 



d TTC 

d6^^°'T 
, vrc 
d6^^^°^T 



vrc vrc vrc 
cos — — \- — sin — 
b b b 

(vrc)^ vrc 
6^^°^T 
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The second derivative is negative when 6 > 2c, and so g{b, c) = 6 cos ^ is concave in 6 > 2c. It follows 
that if 1/2 > 6' > max{2c, then /i(l/2, max{2c, 6}, c) = 5f(max{2c, 6}, c) + ^(1 - max{2c, 6}, c) < 
g{b',c) +5(1 — b',c) = /i(l/2, 6', c), and in particular, 

/i(l/2,max{2c,6},c) < h{l/2,b',c) < /i(l/2, 1/2, c) . (4.4) 

The result then follows. □ 

The following Lemma was required in the preceding proof. 

Lemma 4.5. If 7^/2 < y < x < tt and sinc(z) := then 

( 2 / y\\ y(l-^) 
sinc(y) 1 - -y 1 - - cos 1-,'' y\ > sinc(x) . 

The function sinc(y) is decreasing when y G [7r/2,7r], and so the lemma says that the term after 
sinc(y) is a measure of how much the function drops between y and x. A slightly weaker result that 
is perhaps a bit easier to grasp is 

Vx,y G [7r/2, tt], X > y : sinc(a:) < sinc(j/) cos(2y(l — y/x)) . 

Proof. Rewrite the problem as 



f{x,y) 



V TT V xJJ X smy 



Observe that Va; G [7r/2,7r] : f{x,x) = /(x, 7r/2) = 0, and so the lemma holds at the two extreme 
values for y, that is y = tt/2 and y = x. Moreover, given fixed x G [7r/2,7r], the first partial with 
respect to y at y = 7r/2 is positive: 



9/ 
dy 



2x{Tr — x) cos(7r — x) — {1 — 2(7r — x)) sin(7r — x) 

y=n/2 

. f , 2x(7r - x)(l - f (tt - x)) - (1 - 2(7r - x)){7r - x) 
> mm < 0, 

xe[!r/2,Tv] I TTX 

1 TT — X f , Axin — x) , , 
min <^ 0, 27r - 1 ^ ] '> = . 



xe[n/2,iT] [ nx \ TT 

The first inequality is because the expression is trivially positive if vr — a: G [l/2,7r/2], whereas if 
a := TT — X G [0, 1/2] then use the relations cos a > 1 — and sino < a. It follows that the inequality 
fix, y) >0 also holds near y = tt/2. 

Now, consider the third partial derivative with respect to y: 

d^f _ 'K^x'^{2y-xf . 2/(1- I) 

sm ■ 



5y3 (7rx-2xy + 2y2)5 1 _ |y (1 _ |) 

67r^x(2y - x)(x(7r - x) + 2y{x - y)) y (l - |) 

COS 



(ttx - 2xy + 2y2)4 1 _ |y (1 _ |) 

6y sin X o 6 sin x 9 5v sin x 3 sin x 

+— cof^y cot^y+ — coty 

X sm y X sm y x sm y x sm y 

< 
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The third derivative was negative because every term in it is negative when 7r>x>y>7r/2 (note 
that ,_f^(,^/;)^^ G [0, 7r/2] and irx - 2xy + 2y^ > 0). 

From the third partial we know that the second partial is decreasing, and so for each x G [7r/2,7r] 
there are three possible cases: strictly convex in y, convex then concave in y, or strictly concave in y. 
The function cannot be strictly convex because it is zero at y = 7r/2, then increasing, and a convex 
function could not then be zero again at y = x. If it is convex then concave, then the convex portion 
is strictly increasing because ^(x,7r/2) > 0, while the concave portion starts at a positive vahio and 
ends at f{x,x) = 0. Hence the minimum of the convex portion is at y = 7r/2, and the minimum of 
the concave portion is at y = x. Finally, if it is strictly concave then the minimum is at an endpoint, 
so either /(x,7r/2) or f{x,x). 

It follows that for each value of x G [7r/2,7r] the minimum is either /(a;,7r/2) = or f(x,x) = 0. 
Hence the function is non-negative. □ 
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